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Abstract 

We prove that the largest eigenvalues of the beta ensembles of random matrix 
theory converge in distribution to the low-lying eigenvalues of the random Schrodinger 
operator —-^i + x + -^b' x restricted to the positive half-line, where b' x is white noise. 
In doing so we extend the definition of the Tracy- Widom(/3) distributions to all /3 > 0, 
and also analyze their tails. Last, in a parallel development, we provide a second 
characterization of these laws in terms of a one-dimensional diffusion. The proofs 
rely on the associated tridiagonal matrix models and a universality result showing 
that the spectrum of such models converge to that of their continuum operator limit. 
In particular, we show how Tracy- Widom laws arise from a functional central limit 
theorem. 



1 Introduction 

For any /3 > 0, consider the probability density function of Ai > A 2 > • ■ ■ > A„, G IR given by 

PjCAx, A 2 , . . . , A n ) = ^e-«- ^ 4 J] | A, - X k f, (1.1) 
Zn j<k 

in which Z@ is a normalizing constant. When (3 = 1,2 or 4 this is the joint density of 
eigenvalues for the Gaussian orthogonal, unitary, or symplectic ensembles, G(0/U/S)E, of 
random matrix theory. For these special values of (3, the above model is solvable: all finite 
dimensional correlation functions may be computed explicitly in terms of Hermite functions, 
allowing for startling collection of precise local limit theorems for the random points (see [6] 
for background). 

The law (11. ip also describes a one-dimensional Coulomb gas at inverse temperature /3, 
and thus is of physical interest. In fact, (11.11) is intimately connected to Calogero- Sutherland 
quantum systems and hence to Jack polynomials (an important system of multiple orthogonal 
polynomials). Still, despite long being the focus of several branches of research, there are 
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no known forms of the (3 7^ 1,2,4 correlations which appear amenable to asymptotics. 
The forthcoming and comprehensive text [T7] contains an excellent account of the previous 
general-beta developments. 

It was therefore welcome news when, based on [29J, Dumitriu and Edelman [10] discovered 
the following family of matrix models for all j3. Let g±,g2, ■ ■ -g n be independent Gaussians 
with mean and variance 2. Let also Xj3i X2/3, • ■ ■ , X(n-i)p be independent x random variables 
indexed by the shape parameter. Then the n eigenvalues of the tridiagonal matrix ensemble 
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have joint law given by (11. ip . These are more specifically referred to as the /3-Hermite 
ensembles. 

We focus on the implications of this discovery to the point process limits of the spectral 
edge in the general /3-ensembles. The distributional limits of the largest eigenvalues in 
G(0/U/ S)E comprise some of the most celebrated results in random matrix theory due to 
their surprising importance in physics, combinatorics, multivariate statistics, engineering, 
and applied probability: [Tj, [3] [TB], [2D], [2T| . and [21] mark a few highlights. The basic 
result is: for f3 = 1, 2, or 4 and n f 00, centered by 2^fn and scaled by n 1 / 6 , the largest 
eigenvalue converges in law to the Tracy- Widom(/3) distribution (see [27] and |28j), which 
is given explicitly in terms of the second Painleve transcendent. There are allied results for 
second, third, etc. eigenvalues; see again J27J as well as [SJ. 

The wide array of models for which Tracy- Widom describes the limit statistics identifies 
these laws as important new probability distributions. Still, our understanding of these 
laws is in its infancy. It remains desirable to obtain a set of characterizing conditions, like 
those classically known for say the Gaussian or Poisson laws. A description of the limit 
distribution of the largest eigenvalues in the general /3-Hermite ensembles is a first step, 
providing additional information on the structure of the three Tracy- Widom laws via the 
structure of a one parameter family of distributions in which they naturally reside. 

Towards an edge limit theorem at general /3, Sutton |26j and Edelman and Sutton [12] 
present a promising heuristic argument that the rescaled operators 



H£ = nW (2yftl-H!) 



;i.3) 
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where I is the n x n identity, should correspond to 

n ^-& + x + W K (L4) 

in the n f oo, or continuum, limit. Here b' indicates a white noise, and the proposed scaling 
of the matrix ensembles follows the edge scaling in the known cases. Thus, were it to hold, 
the above correspondence would entail that the low-lying eigenvalues of iff converge in law 
to the those of Hp. Our first result is a proof of this heuristic. 

In Section[2]we give a precise definition of this limiting "stochastic Airy operator" (SAE^). 
For now, let L* denote the space of functions / satisfying /(0) = and J °°(/ / ) 2 + (1 + 
x)f 2 dx < oo. Then we say (ip,X) G L* x K. is an eigenfunction/eigenvalue pair for Hp if 
IIVH2 = 1 and 

r(x) = ^ ^{x)b' x + (x- X)^(x), (1.5) 
holds in the following integration-by-parts sense, 

v'(ao - v'(o) = ^(x)b x + 1*-^ dy + J*(y ~ x ^ dy > ( L6 ) 

where all integrands are products of locally L 2 functions. The set of eigenvalues is then a 
deterministic function of the random Brownian path b. 

Theorem 1.1. With probability one, for each k > the set of eigenvalues of Hp has a 
well-defined (k + l)st lowest element A&. Moreover, let Xi > X 2 > ■ ■ ■ denote the eigenvalues 
of the Hermite (5-ensemble H£. Then the vector 

(V/^v^-A^)) (1.7) 

V / i=\,...,k 

converges in distribution as n — > 00 to (A , A 1; ■ • • , A fc _x). 

We mention that though the n 1//6 scaling across all beta was anticipated from the Tracy- 
Widom results at (3 = 1,2,4, previous authors had only obtained bounds on this rate. For 
instance, [9] proves that the fluctuations of A^i are no greater than n 1 ^ 2 while, for even 
integer values of beta, [7j contains a suggestive calculation on the one-point function in a 
vicinity of the edge which produces the correct order. 

The proof of Theorem 1 1 . 1 1 relies on an equivalent variational formulation of the eigenvalue 
problem. As we shall show: with self evident notation, 

TWp — sup (-4 [°° f 2 (x)db(x)- r[f(x) 2 + xf 2 (x)]dx\, 
/e£MI/l| 2 =i LVP Jo Jo J 
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where the stochastic integral is again defined via an integration by parts procedure. The 
equality here is in law, and one may view this as a definition of TWp, independent of any 
random matrix theory developments. This variational approach actually allows for a type 
of universality result, and we provide rather general conditions under which the spectrum of 
random tridiagonal models converge to that of their natural continuum operator limit. In 
particular, one can obtain the Tracy- Widom laws by considering tridiagonal matrix versions 
of (11.41) far simpler than (jl.3p . 

Our next theorem gives yet another characterization of the limiting spectrum in terms 
of the explosion probability of the one- dimensional diffusion x \— > p(x) defined by the Ito 
equation 

dp(x) = -jg db x + (x - p 2 {x)) dx. (1.8) 

Theorem 1.2. Let k(x, •) be the distribution of the first passage time to — oo of the diffusion 
p(x) when started from +oo at time x. Then 

P(A > A) = «(-A, {oo}), and, for k > 0, 

P(A fc < A) = / K(-X,dx 1 )K(x 1 ,dx 2 ) ■ ■ ■ K(x k ,dx k+1 ). 

Even in the well-understood (3 = 1,2, and 4 settings, any such simple Markovian descrip- 
tion of TWp is novel. Further, a variant of (ll.8p is shown to describe joint laws in Proposition 

Theorem 11.21 combined with the variational picture leads to our final result on the shape 
of the general beta laws, so far only known for (3 = 1, 2, 4. 

Theorem 1.3. With TWp = -A (/3), for a | oo it holds 

F(TWp > a) = exp (-jj/3a 3/2 (l + o(l))), and 
v(TWp<-a) = exp(-^/3a 3 (l + o(l))). 

As indicated, the above discussion carries over to a large class of /3-ensembles. For 
example, let W be an n x k matrix comprised of independent standard real, complex, or 
quaternion Gaussians. Viewing W as k random vectors, the sample covariance matrix WW^ 
— known as the Laguerre, or Wishart ensemble for (3 = 1, 2, 4 — plays an important role in 
mathematical statistics. 

Johansson, in the complex case [20], and Johnstone, in the real case [2T], showed that 
the largest eigenvalues tend to the (3 = 2 and [3 = 1 Tracy- Widom laws whenever k/ti — > 
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d G (0,oo). Later, El Karoui [13J showed the same result holds even if k/u — > 00 or if 
both n, k, — > 00, this regime being important in applications. Our proof handles all regimes 
simultaneously for all (3. 

To explain, consider now the joint density on points Ai, . . . , X n G 

1 n 

P(L(A 1; A 2) . . - , An) = 4- J] |Ai - hf x J] A^ ^"V^. (1.9) 

Zn ' K i<fc fe=i 

When «; is an integer and (3 = 1 or 2 this is the joint law of Laguerre eigenvalues just 
described (granting k > n which can be assumed with no loss of generality by the obvious 
duality). Notice though that the above law is sensible for any real k > n — 1 and f3 > 0. As 
an application of our universal limit theorem, we also show: 

Theorem 1.4. Let Ai > A2 • • ■ denote the ordered j3 -Laguerre "eigenvalues" (jl.9p . and set 

(v/nK) 1/3 

(v n + V K ) ' 

Then for any k, as n — >• 00 wii/i arbitrary k = K n > n — 1 we /jave 

Cn,re(A*n,K — Af) ) =>■ ( A , A 1; . . . , A fc _ x 



In summary, we have a fairly complete characterization of the general beta random matrix 
"soft-edge" . Within the context of log-gas type ensembles, the present is the first paper to 
rigorously establish local fluctuation theorems of any type away from the classical /3 = 1, 2 or 
4 exponents on the Vandermonde interaction component of the density. Since its appearance, 
there have been several further advances in the general beta picture: [22] and [30] provide 
descriptions of the bulk spectrum, with |25j considering the so-called "hard-edge". The 
second and third papers rely in part on the results here, and based on [30] those authors 
have gone on to prove a refined form of Dyson's conjectures for the bulk eigenvalue spacing 
for all f3 > [31J. 

Once again, Section 2 discusses the definition and basic properties of SAE/3. The diffusion 
connection and Theorem II .21 are detailed in Section 3. Section 4 establishes, the tail bounds, 
Theorem 11.31 In Section 5 we prove a general result, Theorem 15.11 which provides weak 
conditions under which the lowest eigenvalues of tridiagonal random matrices of type discrete 
Laplacian plus potential converge to the corresponding eigenvalues of continuum operator 
limit. It is anticipated that a result of this type will be of future importance in investigations 
of universality in random matrix theory, and in Section 5 it is employed to prove Theorems 
O and [H 
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2 Basic properties of the Stochastic Airy Equation 



Definition of the Stochastic Airy Operator 

We use the usual Schwartz distribution theory. Recall that the space of distributions D = 
D(R + ) is the continuous dual of the space of all smooth compactly supported test 
functions under the topology of uniform-on-compact convergence of all derivatives. Recall 
as well that all continuous functions / and their formal derivatives are distributions. They 
act on via integration by parts, 

-^,/( fc V:= {-l) k J f{x)^ k \x)dx ) 

where the latter is clearly defined. For instance, b', the formal derivative of Brownian mo- 
tion, is a random distribution, as b is a random continuous function. The notation -<•,•>- 
distinguishes the above from an L 2 inner product (•, •). 

Introduce H^ oc: the space of functions / : M + — > K. for which f'lj e L 2 for any compact 
set I. SAE^ is then well defined as a random linear map H^ oc — > D, sending / to the 
distribution 

Hpf = -f" + xf + -fofV. 

As D is only closed under multiplication by smooth functions, one must make sense ft/ as 
an element of that space. Stieltjes integration by parts prompts fb' dx := — f£ bf dx + 
f(y)b y — f(0)b . The latter is a continuous function of y, and we define fb' as its derivative. 

Eigenvalues and eigenfunctions 

We will consider the eigenvalues from two points of view, 

(i) as the solutions of Hpf = \f with given "boundary conditions" , or 

(ii) as solutions of the usual variational problem. 

The first approach is intimately tied to the Riccati transformation, which will be our 
main tool for analyzing solutions. The second will be useful in obtaining bounds on the 
eigenvalue distributions for SAE^. Their equivalence for SAE^ is established over the course 
of the next two subsections. 

Recall the Hilbert space L* defined via the norm 

POO 

\\f\\l = / (/T + (1 + *)f dx, L* = {f: /(0) = 0, H/ll, < oo}. 
Jo 
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The framework just introduced makes it self-evident how to define the L* eigenf unctions and 
eigenvalues of SAE^ in the sense of (i). 

Definition 2.1. The eigenvalues and eigenf unctions of Tip are the pairs (/, A) £ L* x R 
satisfying 

Hpf = Xf, 

where both sides are interpreted as distributions (L* C C°(1R + ) C D, where C°(R + ) are 
those continuous functions on the half-line, vanishing at infinity). The (k + l)st smallest 
point in this set, if it exists, will be denoted A k . 

The / £ L* requirement allows us to avoid technicalities. Proposition 13.71 shows that it 
can be relaxed to / £ C° fl L 2 , or even further. 
If we rewrite TCpf = Xf as 

f" = (x-X + ^b f )f, (2.1) 

the right hand side is a derivative of a continuous function, and thus /' can be taken to be 
continuous (it is defined in an a.e. sense). In this way we arrive at two equivalent formulations 
of the eigenvalue problem. One is the coincidence of the integrated versions of both sides of 
(12. ip . The other is the equivalence of the two sides as distributions. The first reproduces 
the definition (11.51) from the introduction 



f(x)-f'(0) = J [(y-X)f-^b y f'(y)]dy +^f(x)b x , 

showing at once that /' inherits the H61der(l/2)~ continuity properties of b, i.e., that it 
is H61der(l/2 — e) continuous for all e > 0. Thus / £ H61der(3/2) _ . The second, weak 
definition, will be useful in the variational analysis. It reads 



J (p"fdx = J(x-X)(pfdx + J J b y f(y)dy-b x f(x 



(p'dx, (2.2) 



and takes the form 



J ifff dx = J(x- X)f(p - ^bf'tp - -fobfif/ dx (2.3) 

after an integration by parts. 

Before proceeding to the variational approach, we register some simple facts about L* . 

Fact 2.2. Any L* bounded sequence has a subsequence f n that converges to some f £ L* in 
each of the following ways: (i) f n — >£2 / (ii) f n — > /' weakly in L 2 , (Hi) / n — ^ / uniformly- 
on- compacts and (iv) f n — > / weakly in L* . 
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Proof. Convergence modes (ii) and (iv) are simple applications of the Banach-Alaoglu the- 
orem, (iii) stems from the familiar estimates f 2 (x) < 2||/ n || 2 ||/^||2 and \f n (y) — f n (x)\ = 
| J f^l[ XtV j dz\ < ||/^ || 2^ — 2/| 1,/2 ; showing that the sequence is uniformly equicontinuous on 
compacts. Last, (iii) implies L? convergence locally, while the bound sup n J xf 2 dx < oo 
produces the uniform integrability required for (i). □ 

The quadratic form -<f,T~Cpf>- 

The quadratic form -<f,H.j3f>- typically associated with self-adjoint operators is already 
defined for test functions / G Cg^. As a path to generalization, we proceed by decomposing 
the fluctuation term via 

rx+l 

b = b+ (b-b), b x = bydy, and so b' x = b x+1 - b x . 

J X 

The idea is to smooth out the noise and then later control the difference between the noise 



and its mollification. We have: for any / G C ( 



oo 
> 



f oo 

^f,b'y = I f 2 (x)b' x dx + I f(x)f{x)(b x -b x )dx. (2.4) 
Jo Jo 

The necessary extension requires this object to be finite over L*. Step one is a simple bound 
on the Brownian paths. 

Lemma 2.3. For b x , x > a Brownian motion, there is a random constant C < oo so that 

\h _ bJ 

sup sup — = = < C a.s. (2.5) 
x>o o< y <i y log (2 + x) 

As a consequence, \b'(x)\ V \b x — b x \ < C^\og(2 + x) . 

Proof. By the triangle inequality \b x+y - b x \ < \b x+y - bi x+y \ \ + \bi x+y \ - | + \b x - b\_ x \ \ 
it is enough to show that 

sup . < C, 

n >i Vlogn 

where the X n = sup 0<y<1 \b n+y — b n \ are independent and identically distributed. Further, 
P{X n > a) = 2a/2/7t e~ m2 / 2 dm < 2a~ 1 e~ a2//2 , courtesy of the reflection principle. A 
Borel-Cantelli argument completes the proof. □ 



x) 



Using the lemma, we may certainly bound \b' x \ by C(l + x) and \b x — b x \ by a/C(1 
An application of Cauchy-Schwarz in (12.4[) then yields 

Proposition 2.4. There is the bound H/ 2 ,6V| < C||/|| 2 for C = C(b) < oo a.s. In 
particular, -<f,TCpf>- is defined for all f G L* ; ^f,1~Lpg>- can be defined by polarization, 
and is a continuous symmetric bilinear form (L*) 2 — > R. 
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Variational characterization 

We are at last in position to define the ground state energy A = A (/?, to) of Tip variationally. 
As anticipated, we will show that Ao = Ao, the lowest point in the set of eigenvalues of Tip. 
Setting 

A :=inf{^/,^iV : / G L* with /(0) = 0, ||/|| 2 = 1} , (2.6) 

we begin by proving that Ao > 6*2(0;) > —00 a.s. From now on, random constants are 
denoted by C, and deterministic ones by c. 

Lemma 2.5. There are constants ci, 62,63 so that a.s. for all f G L* 

ciWfWl- c 2 \\m <*f,Hf,fy <C a \\f \\l (2.7) 

Proof. The upper bound is from Proposition 12.41 For the lower bound, we repeat the defi- 
nition 

/*oo /»oo 

-<f,H p fy = / (f) 2 dx+ / xfdx 

Jo Jo 

poo POO 

+7Pj Q f 2 (x)b' x dx+^j^ f'(x)f(x)(b x -b x )dx. 

The first line on the right amounts to ||/||* — \\f\\ 2 , an d it suffices to show that the terms 
Bi,B 2 in the second line do not ruin the picture and satisfy B\ + B 2 > — Ci||/||* — 6||/||| 
with Ci < 1. 



Lemma I2T51 provides the bounds ^3 1^4 1 < c 2 (6 + x) and ^p\b x — b x \ < c 2 \jC + x for 
arbitrarily small c 2 and some random 6 = C(c 2 ,b). Thus, < c 2 ||/|| 2 + C2C||/|||, and for 
the second term we have 



\B 2 \< / |f(x)/(x)|c 2 y6 T T^rfx<c 2 ||/ / (x)||2 + C2 / (6 + x)/ 2 (x)^. 
In particular, \B\ < 3c 2 ||/|| 2 + C"||/|||, where again c 2 may be taken as small as needed. □ 



Corollary 2.6. The infimum in the variational problem f |2.6j) zs attained at an eigenfunction 
fo of Tip with eigenvalue A . 



Proof. Again by Lemma [273| for any e > the bounds \b' x \ < e(l+x) and \b x — b x \ < ey/1 + x 
hold for all x > X(b, c). As in the proof of Lemma [2. 5 [ it follows 

-<S, Kpfy = 11/112 - ll/lla + ^ jf f 2 {x) b' x dx + ± J* f'{x)f{x){b x - 6.) dx + £, (2.8) 
where the error term satisfies \S\ < e 
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Next choose a minimizing sequence f n G L*: \\fnW2 = 1 with -<f n ,T~tpfn>~- —* A . By 
Lemma |2~31 ||/ n ||* < B for some B = Bib), and by Fact 12. 2\ we can find a subsequence along 
which f n —>■ /o uniformly on compacts, in L 2 , and also weakly in H . Terms 2, 3, 4 on the 
right hand side of (12.81) then converge to their evaluations at /, while term 1 plainly satisfies 
11/11* < liminf ||/ n ||*- Letting e — >• yields -<fo,'HpfQ>- < A , with the opposite inequality 
holding by definition. 

To complete the picture, taking the functional derivative ^-</o + £y?,^/3(/o + £^)>~\e=o 
of the variational problem (I2.6P in directions ip G Cg° shows that the selected minimizer 
satisfies TCpf = A /o in the sense of distributions. It is therefore an eigenfunction and 
Ao = Ao- □ 

The above formulation may now be used to define higher eigenvalues in the expected 
manner: Ai arising from restricting the class of potential minimizers to be perpendicular 
to /o, and so on. This defines a sequence of eigenvalues A&, k > 0, each of which is finite. 
That • • • < Afc < Afc + i < • ■ ■ is proved later, see Proposition 13 .51 The standard trick with 
the bilinear form defined in Lemma 12.41 yields the L 2 -orthogonality of the corresponding 
eigenfunctions (and, as a by-product, the uniqueness of the ground state found above). In 
short: 

Lemma 2.7. The (k + l)st lowest element A& in the set of eigenvalues of Tip exists and in 
fact A k = Ajfe. 

Remark 2.8. Lemma |2"T3"| as used in Lemma [2~51 is related to the second condition of our gen- 
eral convergence result, Theorem l5.lt below. In addition, a discrete version of this estimate 
is employed in the context of the /^-ensembles converging to SAE^. 

3 Riccati transform and diffusion formulas 

The Riccati map is a classical tool in the study one-dimensional random Schrodinger spectra. 
Its use dates back to Halperin [T9] who computed the density of states for —d 2 /dx 2 + b' x , 
though see also [18]. For other applications to local statistics such as the ground state energy, 
see 0], [5], and [23]. 

Return to the eigenvalue problem fll.5p 

r(x) = 4, ^(x% + (x- A)^(x), (3.1) 
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understood in the integration-by parts sense. The Riccati transform is simply the logarithmic 
derivative p(x) = ip 1 (x) / ip(x) . This turns (13 .ip into a first order differential equation: p(0) = 
oo, and 

p'{x) = x-\-p 2 {x)-^b'{x), (3.2) 

understood in the same way. 

Solutions of (13. 2p may blow up (to — oo) at finite times, as will happen whenever ip 
vanishes. In this case p is immediately restarted at +oo at that time-point in order to 
continue the solution corresponding to (13.11) . It is convenient to think of p as taking values 
in the disjoint union of countable copies of the reals, Mo;^-i;K-2 • • • Points (n,x) in this 
space are ordered lexicographically, though we sometimes refer to these points by their second 
coordinate x. A natural topology on this space is provided by the two-point compactification 
of each copy of the reals, glued together at the endpoints so as to respect the lexicographic 
ordering. (This ordering and topology can also be defined by considering the evolution of 
aig(ip' + it])) as a continuous, real-valued function, and applying the tangent map.) 

It is not hard to verify the following. 

Fact 3.1. The solution p\(x) = p(x,X) of (13.21) is unique and increasing in A for each 
x. It is also decreasing in x at each blowup (or "explosion"). Moreover, the function p is 
continuous when the image space is considered in the topology discussed above. 

Next consider the truncation 7i L of Tip, defined on the finite interval [0, L] with Dirichlet 
(ip = 0) boundary conditions at both endpoints. 

Lemma 3.2. Fix X, and denote (— n, y) = p(L, A). Then the number n of blowups ofp(x, A) 
to — oo on [0,L] equals the number of eigenvalues of 7i L at most A. 

Proof. We provide just a sketch. First, A is an eigenvalue of 7i L if and only if p\ blows up to 
— oo at the endpoint L. For large negative A, there are no blowups for any given noise path. 
As A increases, continuity and monotonicity implies that existing blowups move towards the 
beginning of the interval — new ones can only appear at the endpoint L. At those A we have 
a new eigenvalue and the claim follows. □ 

To extend the picture to the full line we need the following. 

Lemma 3.3. As L —>■ oo the first k eigenvalues Ai t0 , . . .A^^i of 7i L converge to the first 
k eigenvalues of Tip. 
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Proof. A trivial modification of the proof of Lemma 15.91 together with Fact 12.21 shows that 
liminf > A*,. Next, for an inductive proof, assume that the A^ — > Ai for £ < k. 
Let ff. be a function of compact support e-close to fk in L*. Let also 

k 

9l = 9L,k = fk~ /Afk, fL,e)fL,e- 
1=0 

The hypothesis entails that f^ -^ L 2 f e . So, for large L, each coefficient in the above sum is 
bounded by 2s, and Ql will be ce-close to fk in L* . Then, by the variational characterization 
we have 

limsupA^^ < limsup — r — , 

L^oo L^oo \gLi 9l) 

since /| is eventually supported on [0, L}. Then, as s — > 0, the right hand side converges to 
-<fk,Hf,f k y/(fkJk) = A fc . □ 

Taking the L — > oo limit of the claims of Lemma 13.21 then yields 

Proposition 3.4. Let N(X) be the number of blowups of the equation (13. 2p to — oo. Then 
for almost all X, N(X) equals the number of eigenvalues of Tip at most X. In other words, 
the cadlag version of N(X) is Tip 's eigenvalue counting function. 

Of course, for any fixed A, the Riccati equation may be taken in the Ito sense, 

dp(x) = —-jjj db x + (x — X — p 2 (x)) dx, (3.3) 

which is to say that p = p\ = (log^)' performs the indicated diffusion, restarted at +oo 
instantaneously after each explosion to — oo. The content of the above is that the total 
explosion count equals the count of eigenvalues < A. 

Proof of Theorem [77E The strong Markov property of the motion (13.31) implies that the 
sequence of explosion times, mo = 0, mi, TTI2, ... is itself a Markov process. Let k(x, ■) be the 
distribution of the first such time of po(x) under P(oo,a;), that is, when started from 00 at 
time x. This law is supported on (x, 00] with a point mass at 00. By the preceding, 

P(Afc_i < A) = P( oo )(pa(^) has at least k explosions) 
= P(oo,-A)(po(a ; ) has at least k explosions) 
k(— X, dxi)n{xi, dx 2 ) ■ ■ ■ k(xi~-i, dxk). 



The second equality uses the obvious translation equivariance of p. □ 
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In addition: 



Proposition 3.5. The Tip eigenvalues are a.s. distinct with no accumulation point. Es- 
pecially, Afc — > oo as k — > oo a.s., and the minimization procedure of the previous section 
exhausts the full Tip eigenvalue set. 

Proof. The distinctness follows from the fact that P(iV(A+) — N(X) < 1) = 1. Next, starting 
at any time x, there is positive probability k(x, {oo}) that po will converge to oo without 
blowing up to — oo. Monotonicity implies that k(x, {oo}) is increasing in x. Thus the 
number of eigenvalues below A is dominated by a geometric random variable with parameter 
k(— A, {oo}), whence it is finite. The claim follows. □ 

We mention that second half of Proposition 13.51 may also be established by making sense 
of the resolvent operator of and showing it maps the unit ball in L* onto Holder (3/2) ~ 
functions vanishing at infinity. The simplicity of the above proof demonstrates the advan- 
tages of the diffusion picture. 

This connection between the limiting top eigenvalues of the random matrix ensembles and 
the explosion time of a simple, one-dimensional diffusion is new even in the deeply studied 
cases of (3 — 1, 2 or 4. We now recall the formulas of Tracy- Widom which, in conjunction 
with our result, produce the identities 

expf— | J A °°(s — \)u 2 (s) ds^j exp^— | f£°u(s) ds*j, (3 = 1, 
P(oo,A)(mi = oo) = < expf-/ A °°(s - \)u 2 (s)ds S ), (3 = 2, (3.4) 

exp^— | Jy(s — X')u 2 (s) ds^j cosh( f_^° u(s) ds), (3 = 4. 

Here, u{s) is the solution of u" = su + 2u 3 (Painleve II) subject to u(s) ~ Ai(s) as s — > +oo, 
and A' = 2 2 / 3 A in the (3 = 4 distribution. An important problem for the future is to obtain 
the equivalent of the closed Tracy- Widom formulas for general (3. Even a direct verification 
of (13.41) would be interesting. 

Remark 3.6. The diffusion ( I3.3P seems efficient for simulating Tracy- Widom distributions as 
well as distributions of higher eigenvalues. First note that for x < 1, p(x) comes down from 
+oo like 1/x. Also, the more time accumulated inside the parabola p\ = {p 2 (x) < x + X}, the 
less likely explosion becomes. That is, the typical path which hits — oo does so by tunneling 
through the narrow part of p\. In line with these heuristics good simulations of the general 
TW/3 distributions may be obtained by tracking the explosion probability for p(x) begun at 
say p(0) = O(10 3 ) and run only for 0(1) time. 
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Moving to applications of the Riccati map to the eigenf unctions, a detailed but standard 
analysis of the diffusion (13. 3p using domination arguments shows that for each A, with prob- 
ability one, p(x)/^/x — > 1 (after a finite number of initial blowups and restarts). By Fubini, 
this holds for almost all A. By monotonicity, this happens for all A except for eigenvalues. 
Thus we get 

Proposition 3.7. A.s. for all solutions f G Hl oc oflipf = A/, with f(0) = we have the 
following. If f is not an eigenfunctions then f (x) / (f(x)^) — > 1. In particular, if f grows 
slower than exp((2/3 — e)x 3 ^ 2 ) then f is an eigenf unction. 

Remark 3.8. Note that the results of this section, and indeed, the entire paper, easily extend 
to general initial boundary conditions. 

We conclude this section with a decay bound on the Hp eigenfunctions. Compare this 
to the noiseless (/3 = oo) limit in which case all eigenfunctions are simply shifts of the Airy 
function Ai(-). 

Proposition 3.9. Let f be any eigenfunction of Hp. Then, for any e > there is a random 
constant Cf t£ so that 

\f(x)\ <C /i£ exp(-(2/3-e)x 3 / 2 ). 

with probability one. 

Proof. Let p, q be the solutions of (13. 2\\ corresponding to Dirichlet and Neumann boundary 
conditions at (i.e. g(0) = 0), with p = f'/f for the specified eigenfunction /. Then q 
cannot correspond to an eigenfunction of the Neumann problem and so by Proposition 13.71 
(in conjunction with Remark l3.8p q(x)/y/x — > 1. From the differential equation (13.21) we 
have 

— ( q - p ) = 

and so 

(q-p)(x) = Cexp (^j ~(q + p)(y)dy^j , (3.5) 

for m < oo some random time past the final explosion of q with p(xn) finite. With the 
notation Q(x) = J q(y) dy, P(x) = f x p(y) dy, and R(x) = Q(x) — P(x) the above reads 

R'( x ) = Cexp(R(x) - 2Q{x)). (3.6) 

Now Q(x) = 2/3x 3 / 2 (l + o(l)) and P{x) = C + log \f{x)\ -> -oo, implying 

R{x) - x 3/2 -> oo. (3.7) 
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To finish, it suffices to show that for all e > and x > Xq(e) we have R(x) > 4/3(1— e)x 3 ^ 2 . 
Assume the contrary. Then we can find Xq large so that the right hand side of (13.61) at x = Xq 
is at most 1 and for x > Xq we have q(x) > 1/2. We claim that the solutions of the ODE 
(13. 6p started x = x are dominated by the solutions of R'{x) = 1. Indeed, when R(x) < R(x) 

Cexp(R(x) - 2Q(x)) < C exp(R(x) - 2Q(x)) 

= C exp \R(x ) - 2Q(x ) + £ 1 - 2q(y) dyj < 1 

so that the monotonicity can be maintained. Thus R(x) < R(x) = C + x for all large x, 
contradicting (13 .7p . □ 



4 Tracy- Widom tail bounds: an application of SAE^ 

This section contains the proof of Theorem 11.31 

With TWp = -A (/3) now defined by (I2"B . both the upper bound on ¥{TWp < -a) and 
the lower bound on P(TWp > a) follow from suitable choices of test function / in 

-<LH p fy = J o [f{x) 2 + xf\x)}dx + -j=J^ f{x)db x >A,\\f\\i 

The other two bounds run through the Riccati correspondence. 

Lower bound, right tail. Begin with the observation that 




for any choice of / £ L*, and g a standard Gaussian variable. Here we have used the rule 
that for h deterministic, hdb is a centered Gaussian with variance WhW 2 ,. 

Wishing to maximize this probability, the observation is: for there to be a large negative 
eigenvalue the random potential, and then so also the eigenfunction /, should be localized. 
This leads one to neglect the J °° xf 2 term and look for the maximizer / of the expression 
entering the Gaussian tail, namely 

a 00 / 4 ) 172 ' 

Viewing this problem on the whole line and neglecting boundary conditions, the optimiz- 
ers can be computed exactly: f(x) = Cisech( v /ax + c 2 ). 
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Note that sech. 2 (x)dx = 2, (sech'(x)) 2 dx = 2/3, and sech 4: (x)dx = 4/3. Let 
f(x) = sech(y / a(x — 1)). Then, on IR + , with ~ denoting asymptotics as a j oo we have 

oil/Ill ~2Va, llv^/lll = O(^), ||/||^-^. 

Further, while /(0) 7^ 0, it decays exponentially there as a — > 00, allowing for an admissible 
modification which shares the above evaluations. Returning to (14. ip we find that 

¥(TWp>a) > P^x A a -i/4 s< _ a i/2^ + 2 + o(1 Ay 
producing the desired bound from the simple Gaussian tail estimate P(g > c) = e~° 2 ^ 2+ °^ . 



Upper bound, left tail. The reasoning is the same as that just employed, though in 
minimizing the right hand side of ¥{TWp < —a) < F(-^,f,TCpf>- > a(f, /)) one expects the 
optimal / to be relatively "flat". Neglecting the J {f') 2 term leads to the choice 

f(x) = (xi/a) A \/(a — x) + A (a - x) + . 

The middle term is dominant, while the others control ||/'||2- Then 



^~y, \\f'\\l = 0(a), \\Vxf\\l ~ \\f\\t~j. 



The proof is completed by substitution 



¥(TW p <-a) < F^xia 3 / 2 fl >a 3 (^+o(l)jj = exp (-^ a 3 (l + o(l))) . 

Lower bound, left tail. For this we use the diffusion description of Theorem 11.21 namely 
FiTWp < —a) = P( 00> _ a )(p never explodes), where p is the diffusion ( 11 .81) . and the subscript 
indicates the space-time starting point. By monotonicity, 

P(oo-a) (p never explodes j > P(i,- a ) (p never explodes j 

> P(o,-a) (p( x ) £ [0, 2] for all x G [—a, 0]jP( 0i o) [p never explodes J . 

The last factor in line two is some positive number not depending on a. To bound the first 
probability from below, employ the Cameron-Martin-Girsanov formula 

P(i-a) (p{x) G [0, 2] for all x G [-a, 0] 

= E ( i _ ) exp (-^ J (x - b 2 x )db x - ^ J (x - b 2 x ) 2 dx^j ; b x G [0, 2] for all x < 

16 



where b x is a Brownian motion with diffusion coefficient On the event in question, 



and 



J_ (x-b 2 x ) 2 dx = ^a 3 + 0(a 2 ), 



r° 1 a r° 

j (x - b 2 x )db x = ab- a + -(b 3 _ a - b%) + (p-l) J b x dx = 0(a). 



To finish, note that P(- Oj o)(&a; [0,2] for x < 0) > e ca , and so does not interfere with the 
leading order. 

Upper bound, right tail. We thank Laure Dumaz for helping correct an earlier, flawed 
attempt. The present proof is based on her master's thesis [11] . which contains more precise 
estimates. Write, for a ^> 1 and a large c > to be determined, 

¥{TW P >a)= PooCm-oo < oo) < P^„ c (m_^ < oo), (4.2) 

where m denotes the passage time to the indicated level of the process 

2 

dp(x) = —=db x + (a + x — p 2 (x))dx. 

VP 

To bound the rightmost probability in (14.21) we introduce a further control on the paths, and 
show that there is a numerical constant d so that 

F v^-i( m -v^ < °°) < c ' P ^-i ( m -V^ < 00 > m v^ > m -V^) ■ ( 4 - 3 ) 

This is accomplished by two applications of the (strong) Markov property. From now on we 
denote m± = m ± and A = {m + > m_}, and have 

P^- c (™- < oo) = P^_ c (m_ < oo,.a) +P >/5 _ c (m_ < oo,„4 c ) 

< P^- c (m_ < oo, A) + P^ (m_ < oo) 

< ^- c ( m - < °°' A ) + p ^( m v^-c < oo)P^_ c (m_ < oo)- 

Both inequalities use the fact that the hitting probability of any level below the starting 
place is decreasing in the staring time of the diffusion p. The desired bound (14. 3 j) then lies 
in the following claim, the proof of which we defer to the end. 

Claim 4.1. There exists a large enough c so that Pys( m ya-c = °°) ^ s uniformly bounded 
below (i.e., independently of a^> c). 
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We proceed by performing a change of measure, 

P>- C (m- < oo,^lJ = ME^ R{q,L),m+ < m_ < L , (4.4) 



where g is the diffusion with reversed drift, 

2 



dg(x) = —=db(x) + (g 2 (x) — x — a)dx, 
VP 



logi?(g, L) = ^- I {a + x — q 2 {x))dq{x). 



and the Cameron-Martin-Girsanov factor is given by: 

»LAm_ 

An application of Ito's lemma shows that, for any 2 > 0, 

(a + x-g 2 (x))dg(x) = a(q(z) - g(0)) - ^(q 3 (z) - g 3 (0)) (4.5) 







+ zq(z) + (4/P-l) q{x)dx. 







For 2; < m_ A m + , then \q{x)\ < y/a for x £ [0, 2] so the first line is bounded by a function 
of a, and the second line is bounded by (1 + 14//3 — l|)-y/am_ = Ciy/am_. We will show that 

logE^_ c [e Cl ^, m+ < m _] = (a 3 / 2 ). (4.6) 
Letting L — > 00 in ( 14. 4p . the dominated convergence theorem combined with ( 14.51) gives 

P(T^ > a) < c ' e -|^ 3/2 (i+°d)) ; (4.7) 

since for z = m_ and g(0) = y/a — c the first line of (14. 5 j) equals — (4/3)a 3 / 2 + 0(a). 

Next recall that the expectation (14. 6 p is in terms of the g-diffusion, and notice that on 
the event {m + < m_} the g-drift is bounded above by —x. Therefore, if we introduce the 
process q(x) = -^b(x) — x 2 /2 and let m = inf{x > : q(x) = — y/a}, it is enough to show 
that logE^e ^™] = o(a 3 ^ 2 ) for any constant c. For this, the simple bound 

P^(m >t)<F v - a (jpbit) >t 2 -^)< e~l t3 for t > 4(1 + -^)y/a~ 

will do the job. 

To finish, we return to the proof behind the key bound ( 14.31) . 



Proof of Claim \£T\ Certainly, the probability that p, begun at y/a, never reaches 



a — c 



is bounded below by the same probability for p replaced by its reflected (downward) at 
y/a version. Further, when restricted to the interval [y/a — c, y/a] , the p-diffusion has drift 
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everywhere bounded below by x. Thus we may consider instead the same probability for the 
appropriate reflected Brownian motion with quadratic drift. 

To formalize this, it is convenient to shift orientation. Let now p(x) = -^b(x) — x 2 /2, 
and let p*(x) denote p reflected (upward) at the origin. Namely, 

p*(x) = p(x) — inf p(y). (4.8) 

y<x 

If we can show that for a large enough c, F (p*(x) never reaches c) > 0, then the p-probability 
in question will also be bounded below, independent of a. 

What we actually prove is that M = sup x>0 p*(x) is a.s. finite. Let re > 1, and for each 
n > define the event 

D n = jp(x) hits (1 — n)re for some x between m_ ntt and m_( n+ i) re |. 

^From the representation (14.81) one sees that M > 2k implies that some D n must occur. 
Define as well the event 

A K = {p(x) > -^(^ 2 + 1) fo r all x > o|, 

and note that, on A K , m_ nre > \J1n — 1. Hence, still on A K , for all n > 1 the shifted process 
p(m_ nK +x)— p(m__ nK ) is dominated by the scaled Brownian motion x t— > -^b(x) plus constant 
drift -(1/2) V2n- 1. It follows that: for n > 1, 

F(D n n A K \F m _ nK ) < F (x^ {^b(x) - \^2n - 1 x) hits k before - re) (4.9) 

1 

_ I _)_ e «;(/3/2)v / 2S=T ' 

Putting the above together we have 

p(m>2k) < p(^) + P(A (t n(u n > i5 n ))<P(^)+P( J Do) + ^P(An^ n ). 

n>l 

The sum of the series converges to by (14. 9p . Since -D implies that sup x>0 p(x) > re, both 
P(^4£) and P(-Do) tend to zero as re — > oo by the Law of the Iterated Logarithm. □ 

5 Convergence of discrete models and universality 

This section establishes a general and rather weak set of conditions under which the bottom 
eigenvalues of random symmetric tridiagonal matrices converge to the bottom eigenvalues of 
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a corresponding stochastic differential operator. In many ways this is the central result of 
the paper. 

To explain the setup, consider a sequence of discrete-time R 2 -valued random sequences 
((yn,i,k,y n ,2,k)] 1 < k < n). Let m n = o{n) be a scaling parameter. (In the particular case 
of the Hermite /3-ensembles we have m n = n 1 ^ 3 .) For each n, we build an n x n tridiagonal 
matrix H n . 

Let T n denote the shift operator (T n v)k = Vk+i acting on Ei x 1 2 . . .. Let (T*v)j- = 
ffc_ilfc>i be its adjoint, and let R n denote the restriction operator (R n v)k = Vklk<n- Let 
also A n = m n {l — T^) be the difference quotient operator, and finally set 

(Tn + T^y (5.1) 

where the subscript x denotes element-wise multiplication by the corresponding vector. Then 
H n maps the coordinate subspace ]R n — > K n , and its matrix with respect to the coordinate 
basis in M n is symmetric tridiagonal with (2m n + m n (y n ,i,k — Vn,i,k-i), k > 1) on the diagonal 
and (— m n + m n (y n ,2,k — y n ,2,k-i)/2, k > 1) below and above the diagonal. Roughly speaking, 
H n is the discrete Laplacian plus integrated potential y nt i + y n ^- 

Additionally, define y n ,i(x) = yn,i,[xm n \^ X m n e%n]- By choice, {^fm^ x y n;i;k ,k > 0) is on 
the scale of a simple random walk, so no additional spatial scaling will be required. 

Our basic convergence result rests on two sets of assumptions on the processes y n ,i=i,2- 

Assumption 1 (Tightness/Convergence) There exists a continuous process x \— > y(x) such 
that 

(y n ,i(x); x > 0) £ = 1,2 are tight in law, 
{Vn,i{x) +y n>2 {x); x > 0) (y(x);x>0) inlaw, (5.2) 

with respect to the Skorokhod topology of paths, see [T5] for definitions. 

Assumption 2 (Growth/ Oscillation bound) There is a decomposition 

k 

y n ,i,k = m^^^n^e + w n ,i, k (5.3) 
e=i 

for T] n ,i,k > 0, deterministic, unbounded nondecreasing continuous functions f](x) > 0, ((x) > 
1, and random constants n n {u) > 1 defined on the same probability space which satisfy the 
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following. The tight in distribution, and, almost surely, 

fj(x)/K n - K n < r} n>1 (x) +77„, 2 (x) < K n (l + fj(x)), (5.4) 

VnM < 2m 2 (5.5) 

kn,i(0 - wvi(x)| 2 + |tWn, 2 (0 - w n:2 (x)\ 2 < K n (l+fj(x)/((x)). (5.6) 

for all n and x, £ G [0, n/m n ] with |x — £| < 1. 

We may now define the limiting operator. Just as in Section [2] we note that 

H=--f- 2 +y'(x) (5.7) 
dx z 

maps if^,, to the space D of distributions via integration by parts. Without changing the 

notation, we generalize the Hilbert space L* C L 2 (M + ) introduced there. This consists of 
functions with /(0) =0 and 

POO 

11/112= / f(x) 2 + (l + r ? (x))/ 2 (x) C /x<oo. 

The eigenvalues and eigenfunctions are defined again as (A, /) G 1 x I* \ {0} with ||/||2 = 1 
satisfying (15. 7p . Recall from Section [2] that this means 




-y(z)f'(z)-\fdz + f(x)y(x), 



or, equivalently, for all ip G it holds 

J f(f"dx = J -Xfip - yf'tp - yfip' dx. 

With the picture laid out, a few words on Assumptions 1 and 2 are in order. The former 
simply asks for correspondence between H n and H at the level of integrated potentials. The 
latter, more technical condition, will imply the compactness necessary to maintain discrete 
spectrum as n f 00. 

Theorem 5.1 (Convergence in law). Given Assumption 1 and 2 above and any fixed k, the 
bottom k eigenvalues of the matrices H n converge in law to the bottom k eigenvalues of the 
operator H. 

We will also show that, after a natural embedding, the eigenfunctions also converge in 

L 2 . 
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Proof: Reduction to the deterministic case. It will be convenient to find subsequences along 
which we have limits for all desired quantities. 

The upper bound (15.41) shows that (J* rj n<i {t)dt; x > 0) is tight in distribution for i — 1,2. 
For any subsequence we can extract a further subsequence so that we have joint distributional 
convergence 

PX 

( / r) nii {t)dt\ x > 0) (vj(t)dt; x > 0), 
Jo 

(Vn,i{ X )l X >°) (Vi( X )l X > °)> ( 5 - 8 ) 

where the convergence in the first lines is in the uniform-on-compacts topology, and the sec- 
ond, in the Skorokhod topology. Then by Skorokhod's representation theorem (see Theorem 
1.8, Chapter 2 of we can realize this convergence as a.s. convergence on some probability 
space so that the conditions of Proposition 15.21 below are satisfied with probability one. 

Note that (I5.4H5.5I) are local Lipschitz bounds on the J r] nt i, and so they are inherited by 
their limit rjj. Thus rji = (r/l)' is defined almost everywhere, and satisfies (I5.4H5.5I) . Further, 
r/i can be defined everywhere so that (I5.4H5.5I) continues to hold. 

It also follows that each w n ^ = y n ,i — Vn,i must have a limit which we denote Wi. We 
further denote w = w\ + W2 and r] — rji + 1]2. The claim now follows from Proposition I5.2I 
below. □ 

Proposition 5.2 (Deterministic convergence). Assume that the each of the convergence 
statements in Ii5. 8\) hold deterministically and that the bounds ( |5.^j5. 6\) hold with some de- 



terministic constant k replacing K n . Then, for any k, the lowest k eigenvalues of the matrices 
H n converge to the lowest k eigenvalues of H. 

In the next subsection, we establish some properties of the limiting operator. Afterwards, 
we prove Proposition I5.2I 

Properties of the limiting operator 

Just as in Section [21 we extend the bilinear form -<-,H->- from x L* to L* x L*. We 
want to define the extension as -<f,Hfy := J f' 2 + r\f 2 dx + j f 2 dw (with -<f,Hg>- then 
defined by polarization), but we still need to define and control the last term. By the next 
lemma, this can be done via the integration by parts already employed in (12. 4ft : 

roc poo poo rx+l 

/ f 2 ( x )dw x = f 2 (x)(w x+1 -w x )dx + 2 f{x)f(x)( w t dt-w x )dx. (5.9) 

JO Jo Jo Jx 
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Lemma 5.3. The integrals on the right of A5.9\) are defined and finite for f e L*. Moreover 
there exist c 8 , c 9 , c w > so that 



cs\\f\\l-c 9 \\f\\l<^f,Hfy<c w \\f\\l. 

Proof. By taking limits of the inequalities 05.4H5.6p on 7]i^ n and w n we get bounds for rji and 
w. In particular max(|w x+ i — w x \, \w x+ i — w x \ 2 ) < c £ + ef] where e can be made small. Now 
we write -</, Hfy = A + B where B is the fluctuation term (15. 9p . and the potential term 
satisfies - c ||/||| < A < c\\f\\l To bound B, first write 

POO 

/ / 2 0) \w x +i - w x \ dx < (f, (c e + e)fj)f). 
Jo 

For the second term, we average the inequality sup^.gi^ \w^ — w x \ < \c £ + e^(x)| 1//2 and use 
an inequality of means: 

POO fX+1 -1 

2 / \f'(x)f(x)( w t dt-w x )\dx< ^i\\f'\\l + (f—( C£ + sf})f). 
Jo Jx v £ 

The above inequalities give \B\ < 2y/e\\f\\1 + t^H/IH- Setting e small we get the results. □ 

The bounds immediately imply the following. 

Corollary 5.4. (i) The bilinear form -<-,H->- : L* x L* — > R is continuous, (ii) It does not 
depend on the decomposition y = w + j rj. (Hi) The eigenvalues and eigenf unctions (A, /) of 
H satisfy -<g, Hfy = X(g, f) for all g G L* . (iv) In particular, -</, Hfy = X(f, f). 

Proof of the Corollary. Since L* C L 2 is a continuous embedding, it suffices to prove that 
-<■, H-y+ eg (•, •) is continuous. This form is nonnegative definite by the lemma. Continuity 

(i) now follows from Cauchy-Schwarz applied to the form and the bounds of the Lemma. For 

(ii) and (Hi) approximate g by smooth compactly supported functions and use continuity. □ 

Together, these two statements provide an analogue of Lemma 12.51 in a more general 
context. In particular, the (discrete) eigenvalues of H may now be defined variationally. 
The arguments used in Section [2] in conjunction with the limiting bounds of (l5.4H5.6p give 
the following. 

Lemma 5.5. The lowest k elements of the set of eigenvalues of H exist and admit the 
variational characterization via the bilinear form -<f,Hfy. 
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Tightness 

Next, we define the discrete analogue of the norm || • ||*. For v G W 1 let ||u||| = m~ l X^=i v l 
with scaling to match the continuum norm. Let f] n ^ = f](k/m n ) and let 

IHL=||A B t;||»+||(^,) 1 / a t;||5 + ||t;||» (5.10) 

and note that II v II 2 < IMLn- We continue with a bound on H n . 



Lemma 5.6. Assume the bounds ([5.^15. 6\l . Then there are constants Cn,ci2,ci3 > so that 
for all n and all v we have 

ClllMlL - C 12|Ml2 < ( V i H nV) < C 13 \\V \\l n . 

Proof. Drop the subscript n, and recall the definition of the difference quotient Av^ = 
m{yk+i — Vk). We recall the following consequence of the discretized bounds f l5.4H5.6j) : 

fjk/K- K < T) 1)k + T)2,k <Kf] k + K 

f]2,k < 2rn 2 , 

\ w i,t ~ w i,k\ 2 — £ Vk + c e , k < £ < k + m, i—1,2. 

Here e can be arbitrarily small at the expense of c £ . Let w k = w^k, u k = W2,k- By definition 
of H n , 

n n 

m(v, H n v) = ((Av k ) 2 + r} 2 ,kVkVk+i + m,kvl) + E (^ W k v k + Au k v k v k+1 ) . (5.11) 

k=0 k=0 

Let A, B denote the two sums. Using the inequality ab > —{b — a) 2 /3 + a 2 /4, we obtain the 
following lower bound for the second summand in A: 

V2,kV k V k +l > ^-{Vk+1 ~ v k ) + — — > -- (Av k ) + — — , 

3 4 3 4k 4 

thus we get A > j*-\\v \\ 2 — cm\\v || 2 . We also clearly have \A\ < cm\\v\\ 2 . To bound \B\, set 
5w k = (w k+m — Wk). Summation by parts gives 



Aw k v 2 k = ^2 ( _ 5w k) v i+^2 5w k v i 

k=0 k=0 k=0 

n / k+m \ n 



fc=0 V=fc+1 / fc=0 
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By the bound on w the absolute value of the first summand in (15.121) is not more than 

m (efj k + c £ ) 1/2 \vl +1 -vl\<-p (ef) h + c £ ) (v k + v k+1 ) 2 + y/em 2 (v k+1 - v k ) 2 . 
Together with the bound on the second sum in (15.121) this yields 

n 



m 



k=0 



< {-1= + 1) (efj k + c+e) v\ + yW J2(v k+1 - v k ) 2 . (5.13) 

V k=0 k=0 

The argument for u starts with the same summation by parts, with v k v k+ i playing the part 
of v\. After bounding the u terms, we use the inequalities 2\v k v k+ i\ <v 2 + t>j? +1 and 



\vkVk-i - v k v k+ i\ < \v k \\v k - v k -i\ + \v k \\v k+ i 



together with Cauchy-Schwarz to get an estimate of the form (15. 13ft for YHt=o Au k v k v k+ i. 
Thus we find that \B\ < c-s/em\\v\\ 2 + c' £ m\\ v\\\. For e sufficiently small the claims follow. □ 

Operator convergence 

Embedding vector spaces. We embed the domain R™ of H n in L 2 (R+) in an isometric 
way, identifying oGl" with the step function v(x) = v^ mnX -\ supported on [0, n/m n ]. Let L* 
denote the space of such step functions, and let V n denote the L 2 -projection to this space. 
Let (T n f)(x) = /(x + m^ 1 ) denote the shift operator, and let R n (f) = /l[o,n/m n ] denote the 
restriction. Let A n = m n (I — T*). These operators are simply extensions of the already 
defined action of T n and A n on L* n . Thus the formula (15.11) extends the definition of H n to 
L 2 . 

It is easy to check the following: (i) V n and T n , and so A n commute; (ii) for / G L 2 we 
have V n f — > / in L 2 ; and (iii) when /' G L 2 and /(0) = we have A n f — >■ /' in L 2 . 

Lemma 5.7. Assume that f n G L* n and f n —>f weakly in L 2 and A n f n — ► /' weakly in L 2 . 
Then for all if G we have (if, H n f n ) — > -<<f, Hfy. In particular 

(Vnf, H n V n f>) = (f>, H n V n f) -> <f, Hfy. (5.14) 

Proof. Because we are dealing with ip of compact support, may drop the restriction part R n 
for H n . The convergence 

(<p, A n A n f) = (A n A n f, f) - (if", f) = -«p, /V 
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is clear, so it remains to check the potential term. First note that if J is a finite interval, 
and g n — g and h n — > h is L 2 (J)-bounded and converges weakly in L 2 (I) then 

(g n ,h n lj) -> (g,hlj). (5.15) 

Let / be a finite closed interval supporting A n <£>, <// and (p. The potential term is 

(ip, ((A n y njl ) x + (A n2/n>2 ) x |(T n + T l n )) f). 

Setting y n = y n i + y n>2 , we first approximate the right hand side by 

((p,(A n y n ) x f n ) = (A{,(p/ n ),y ft ) 

= (^A* n /„ + / n A^ + m^A^A^, 2/ n ) 

= (A*„/ n , Wn ) + (f n , yj^tp) + m-^A'jn, ynK^). 

The first two terms in the above converge to the desired limits by (15.151) . and the last one 
converges to because it is bounded without the extra scaling term. The error term in the 
above approximation comes as a sum of T n and terms; we consider twice the T l n part: 

\(tp, (A n y nt2 ) x (I - T*)f n )\ = \{^m^-A n y nj2 ,A n f n )\ 

< \\m- l A n y n ^li\\2 ||A n / n || 2 sup \ip{x)\. (5.16) 

Now m~ l A n y n ^i is the restriction to I of the difference y n ^ — Tfy n ,2i m which both terms 
converge to y 2 in the Skorokhod topology. In particular, they converge a.e., and since they 
are locally bounded, their difference converges locally in L 2 to 0. This shows that (15.161) 
vanishes in the limit. We handle the T n term similarly. □ 

Lemma 5.8. Recall the discrete \\ ■ ||* n norm from A5.10\) . Assume that f n G L* n , ||/ n ||* n < c, 
and \\fnW2 — 1- Then there exists f G L* and a subsequence so that f nk — f and for 
all (p G C£° we have ((p, H n J nk ) -> -<Lp, Hfy. 

Proof. Since f n and A n f n are bounded in L 2 , we can find a subsequence along which f n — > 
/ G L 2 and A n f n —> f e L 2 weakly. Considering (A n f n , l [0 ,t]) we get that f f = f, that is / 
has a differentiable version and / = /'. The bounded nature of the fj terms in the L*-norm 
gives sufficient tightness so that we have / G L* and /„ — f. The last part then follows 
from Lemma [5.71 □ 

We break up the proof of Proposition 15.21 into two Lemmas. Let (X n> k, v n> k), k > be the 
lowest eigenvalues and the embedded normalized eigenfunctions of H n , and let (A*., fa) be 
the same for H. 
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Lemma 5.9. For k > we have X k = liminf Afc ra > A k . 

Proof. Assume \ k < oo. Since the eigenvalues of H n are uniformly bounded below, we 
can find a subsequence so that (A n) i, . . . , A nj fc) — > (£i,...,£fc — Afc)- By Lemma |5~6| the 
corresponding eigenfunctions have L* norm uniformly bounded, and Lemma I5T81 now implies 
that for a further subsequence, their L 2 limit exists. Moreover, by the same lemma this 
limit must consist of orthonormal eigenfunctions of H with eigenvalues at most A fc . The 
orthonormality of the eigenfunction set shows that they correspond to k distinct states and 
the proof is finished. □ 

Lemma 5.10. For k > we have X k>n — > A& and v n ^ -^l 2 f it- 
Proof. For an inductive proof, we assume the claim holds up to k — 1. First, we find f k G 
e-close to f k in L*. Consider the vector 

fc-i 

fn,k = V n f k - Y,^V n fi)v n/ . (5.17) 

i=\ 

We have a uniform bound on the L* n norm of v n j by Lemma 15.61 and \\V n f k — f n ,fc||2 < 
W'Pnfk ~ /lib + \\v n ,k ~ /felb, is, for large n bounded by 2s. Thus the L*-norm of the sum is 
bounded by ce. By the uniform bound (v, H n v) < c||v||^ n of Lemma [5761 and the variational 
characterization in finite dimensions we also have that 

,. \ i • \fn,k>H n f nk ) .. \P n f k ,H n 'P n f k ) , . . 

hmsupAn,* < hmsup— — — = hm sup — — — — — — + o e (l), (5.18) 

ra->oo \Jn,k, Jn,k) n-*oo \f nJk) ' nJk) 

where o e (l) — > as e — > 0. Then (15.141) of Lemma [5.71 provides 

hm (V n flH n V n ft) = ^f £ k ,Hf k y, 
and therefore the right hand side of (15.181) equals 

^ikErni + 0e(1) = -<h,Hf k >- + 0e(1) 

(fk'fk) (/*' fk) 

Now letting e — > the right hand side converges to ~<fk,Hf k y/(f k ,f k ) = A k . We have 
shown A n>fc -> A fc . 

Lemma IS~8l implies that any subsequence of the v n ,k has a further subsequence converging 
in L 2 to some g e L* satisfying i/g = A fe #. Thus (7 = f k , and so t> n fc — > L 2 f k . □ 
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6 CLT and tightness for tridiagonal /3-ensembles 

At last we verify that the /3-Hermite and Laguerre ensembles satisfy the conditions fl5.2p - fl5.6l) 
of Theorem 15.11 and so complete the proofs of Theorems 11.11 and 11.41 

The following theorem is what we need from the far more general Theorem 7.4.1 on page 
354 in Ethier and Kurtz [T5|. Denote Ay nfc = y n ^ k — y n ^-\. 

Corollary 6.1. Let a G R and h e C*i(R + ), and let y n be a sequence of processes with 
Vn,o — and independent increments. Assume that 

m n EAy njk = ti(k/m n ) + o(l), m n E(A y n ^ k ) 2 = a 2 + o(l), m n E(A y n)k ) 4 = o(l) 

uniformly for k/m n on compact sets as n — > oo. Then y n {t) = y n ,\tm n \ converges in law, with 
respect to the Skorokhod topology, to the process h(t) + abt, where b is standard Brownian 
motion. 

Proof. The time-homogeneity required in the theorem can be replaced by introducing a 
space coordinate recording time. The supremum increment bound of the theorem follows 
from Markov's inequality and the fourth moment bound here. □ 



The /3-Hermite case 

Starting with the scaled Hermite matrix ensembles H n = H@, we identify m n = n 1 ^ . After 
rearranging some terms we find 

k 

Vnxk = w n ^ k =-n- 1 /\2/(3) l ' 2 Y,9^ 
k 

Vn,2,k = n~ 1/& ^yi(yn- -j=Xp(n-e)j- 

Also, by choosing r) n ^,k — — 2[3^ 1 ^ 2 ¥.Xf3{n-k) both w n> i t k and w nj 2,k are independent- 
increment martingales. Using the notation and results of Corollary 16.11 and standard moment 
computations for the normal and gamma distributions, we get the following. 

Lemma 6.2. As n — ► oo for the Skorokhod topology we have, in law 

y nti (-) (2/(3)^ 2 b x + x 2 (t-l), i = 1, 2. 
Independence of the i — 1, 2 cases now implies (15.21) of Assumption 1. 
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Lemma 6.3. The bounds (15.41) . (15.51) of Assumption 2 hold with r)(x) 
Proof. There is the estimate 

VF(1 - 4/r) < E Xr = v / 2 r(( l + 1 . ) / 2) < ^ 



x. 



r(r/2) 

and, with again ^ nj 2,fc = 2^/n — 2f3~ 1 / 2 E,Xf3( n -k), it follows that 

A;n~ 1/2 - c < r/ n> 2,fc < 2kn~ 1/2 + c, 

where c depends on (3 only. □ 
Lastly, for (15.61) of Assumption 2, it suffices to prove a tight random constant bound on 

SUp k 6 ' 1 SUp \w n)i)kmn+ t - W n ^ km \ 2 . 
k=\...n/m n t=0...m n 

(Notice the estimate is being done in blocks.) Squaring, replacing the first supremum by a 
sum, and then taking expectations gives 

n/m n I ,4 n/m n 1 i |4 

su P^=0...m n I w n,i,km n +l ~ W n ^^km n \ ST~^ i0J ^ \ w n,i,(k+l)m n w n,i,km n \ 

U2-2S — 2-^/ 



k=l 



k=l 



k 



2- 2s 



Here we used the LP maximal inequality for martingales, see Section 2.2 of [15] . The ex- 
pectation is now bounded by a constant independent of n, k, and so is the entire sum, as 
required. 



The /3-Laguerre case 



Once again [10J provides a family of tridiagonal "/5-Laguerre ensembles", with explicit eigen- 
value densities interpolating between those at (3 = 1,2,4. Take the nxn bidiagonal random 
matrix 

Xp{n-1) X/3(k-1) 



W 13 



VP 



(6.1) 



X/32 Xp(n-ri+2) 

X/3 X/3(K-n+l) 

where the entries are all independent \ variables of the indicated parameter. Here k e R 
and necessarily k > n — 1. Then, by [10], the eigenvalues of {W£ K y(W£ K ) have joint density 

(USD. 
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While the above puts k > n — 1, the obvious duality reproduces all known real and 
complex {(3 = 1, 2) results for any limiting ratio of dimensions k — > oo and n — > oo. This (3 
family generalizes the so-called "null" Wishart ensembles, distinguishing the important class 
of WTW^ type matrices with non-identity S. For progress on the spectral edge of the latter, 
consult [2] and [II} . 

We now proceed with the proof of Theorem 11.41 It suffices to prove the claim along a 
further subsequence of any given subsequence. This allows us to assume that k = k(ti) is an 
increasing function of n, and that n/nin) — > d G [0,1]. Begin with the matrix (16.11) . denoted 
now simply W n . The "undressed" ensemble f3W} l W n has the processes 

X/L + X|(n-1)> X/9(/e-l) + X/3(n-2)> X/3(/t-2) + X/3(n-3)> ■ ■ ■ (6-2) 
X/3(K-l)X/3(n-l), X/3(K-2)X/3(n-2), X/3(/c-3)X/3(n-3) j • • • 

(6.3) 

along the main and off- diagonals, respectively. Up to fist order, the top left corner of the 
matrix W}yV n has n + Kon the diagonal, and \Znn off-diagonal. That is, the top left corner 
of 

yrlK V / 
is approximately a discrete Laplacian. If time is scaled by m^ 1 , then space will have to be 
scaled by m 2 for this to converge to the continuum Laplacian. Now the desired convergence 
of drift and noise terms determines, up to constant factors 

m n = ( /fV ) m , H n = ^L ((Vn + ^fl n - WiW n ) . (6.4) 



Now the y's are defined by formula (15.31) . and are just partial sums of shifted and scaled 
versions of (16.21) and (16. 3p . That is, 

Tfl f \ 

Ay n ,i,k = -^=(n + K-/T (xl<n-k) +X^-fe+l))), 

A«/n,2,fc = -7= 2( - /3 _1 X/3(n-fc)X/3( K -fc))- 

\/TIKj V / 



As before, we set 77 to be the expected increments and w to be the centered y. The 

i = 1,2 are again independent increment processes, though they are not independent of one 

another. We set 7 = lim^^ 2^Jn/ n/(^Jn/ k + l) 2 G [0, 1/2]. Then with x = m n /k we have 

m n EAy hntk = 71 + 0(1), 
m n E(Ay hntk ) 2 = Lll + (1), m„E(A ^ n>fc ) = o(l), 
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uniformly for k/m n in compacts, so Corollary 16.11 shows that y n ,i(%) converges to the process 
^/j/\/2^b x + 7x 2 /2, whence it is tight. Similarly we get the process convergence y n ^{x) =^ 
VT^/VWb x + (l-i)x 2 /2. 

To get the convergence of the sum and (|5.4p . we instead consider the process defined by 

noting that the process y n ^ — y n ,i,k — y n ,2,k is in sub-scaling and hence converges to the 
process in law by a fourth moment bound. Now y n ^ has independent increments, and the 
same brand of moment computations already considered along with Corollary 16.11 imply that 
it converges to 2/y/]3b x + x 2 /2. 

Towards tightness, we set rj n ^^ = m n EA?/j jn) fc. A bit of work shows that for (3 fixed and 
all k > 1 k, n > 10 we have 

k k 

Ci— < m n (r) n i k + T]n,2,k) < c 2 — 

m n m n 

We also have the upper bound m n (r] n ^,k) < 2m 2 . This verifies (15. 4R and ( 15.51) with fj(x) = x. 
The verification of the oscillation bounds (15.61) is identical to the /3-Hermite case. Indeed, 
all we used there was that sjvn n vj n ^^ are martingales whose increments are independent and 
have bounded fourth moments. 
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